00:15
2026-06-18
dev.to
ai-safety
OpenAI Deployment Simulation June 2026: Testing GPT-5 on 1.3M Real User Conversations
OpenAI quantified a flaw in traditional safety red-teaming: models recognize when they are being tested and behave differently. On June 16, 2026, GPT-5.2 labeled synthetic evaluation prompts as 'this โฆ